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Abstract — We consider the downlink of a MIMO-OFDM 
wireless systems where the base-station (BS) has M antennas 
and serves K > M single-antenna user terminals (UTs). Users 
estimate their channel vectors from common downlink pilot 
symbols and feed back a prediction, which is used by the 
BS to compute the linear beamforming matrix for the next 
time slot and to select the users to be served according to 
the proportional fair scheduling (PFS) algorithm. We consider 
a realistic physical channel model used as a benchmark in 
standardization and some alternatives for the channel estimation 
and prediction scheme. We show that a parametric method based 
on ESPRIT is able to accurately predict the channel even for 
relatively high user mobility. However, there exists a class of 
channels characterized by large Doppler spread (high mobility) 
and clustered angular spread for which prediction is intrinsically 
difficult and all considered methods fail. We propose a modified 
PFS that take into account the "predictability" state of the UTs, 
and significantly outperform the classical PFS in the presence 
of prediction errors. The main conclusion of this work is that 
multiuser MIMO downlink yields very good performance even 
in the presence of high mobility users, provided that the non- 
predictable users are handled appropriately. [] 

I. Introduction 

Multiuser MIMO downlink schemes are expected to become 
the corner-stone of future wireless cellular systems since they 
can achieve a downlink capacity that scales linearly with the 
number of base station (BS) transmit antennas M, even though 
each user terminal (UT) has only a single antenna. In order 
to accomplish this linear capacity scaling with M, accurate 
Channel State Information (CSIT) needs to be provided to the 
BS [6], [10], [5]. Following the current standardization trend of 
systems like 3GPP LTE and IEEE 802.16m (mobile WiMax) 
[2] and [1], we assume that the BS broadcasts downlink 
pilot symbols in order to allow each UT to estimate its 
own channel state vector. We assume a frequency division 
duplexing (FDD) scheme, where CSIT feedback must be 
implemented by explicit closed-loop feedback. Since fading 
channels are generally time-varying, the UTs must predict 
their channel state and feed back this prediction via some 
CSIT feedback channel (uplink). In this paper we assume an 
ideal unquantized feedback and focus uniquely on the effects 
of channel estimation and prediction: of course, any form of 
explicit CSIT feedback can only decrease the CSIT accuracy 
with respect to pure prediction and ideal feedback. 

'All authors are with the EE Department of the University of Southern 
California, Los Angeles, CA. This work was partially supported by the 
Collaborative USC-ETRI Project 53-4503-0781. 



The BS computes the downlink beamforming matrix based 
on the predicted user channels. Since in general we have 
K > M users, the BS selects a subset of S < M users that are 
served at each point in time. This selection and corresponding 
rate allocation is known as scheduling. In this paper we 
consider the widely used proportional fair scheduling (PFS) 
algorithm [12], [11]. 

Observing the performance of two rather well-known chan- 
nel prediction methods applied to the SCM channel model [3], 
which is used as a benchmark in standardization, we notice 
that channels with well separated angles of arrivals and/or very 
low maximum Doppler spread are easy to predict. In contrast, 
there exist a class of channels with high Doppler spread and 
clustered angles of arrival for which all prediction method fail. 
A more refined analysis based on the Cramer-Rao Bound [7], 
not reported here because of lack of space, shows that the 
"non-predictability" of these channels is an intrinsic property 
and not just a shortcoming of the algorithms considered. A 
naive application of the PFS scheme for these non-predictable 
users yields large performance degradation, because schedul- 
ing computed from inaccurate CSIT. We propose a simple 
modification of the basic PSF algorithm that is both very 
simple and provides significant improvements. 

For the application of the proposed modified PFS (MPFS) 
scheme it is essential that each UT informs the BS whether 
its channel is predictable or non-predictable. Fig. [T] illustrates 
a block diagram of the process run at each UTs: the UT 
keeps predicting its channel from the downlink pilot symbols. 
Also, it compares its prediction with the actual channel in the 
next time slot, and keeps track of the prediction error The 
UT notifies its predictability status to the BS by comparing 
the estimated prediction error with a suitable threshold. UTs 
change state (from predictable to non-predictable and vice 
versa) at a rate much slower than the CSIT feedback rate. 
Notice also that the non-predictable users need not feeding 
back their CSIT prediction. Therefore, as a by-product, this 
scheme lowers the requirement of the CSIT feedback channel 
by avoiding to feedback inaccurate and useless information. 

II. System Model 

The MIMO-OFDM downlink model with M BS antennas, 
one antenna at each UT and N subcarriers is given by 



Uk [t, n] = h" [t, n]x.[t, n] + Zk [t, n] 



(1) 
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Fig. 1 . Block diagram of tlie prediction and prediction error tracking process 
at each UT. 



for k = 1 , . . . , and n — 1, . . . , N, where t denotes 
discrete time (that ticks at the OFDM symbol rate), n is the 
subcarrier index, k is the UT index, yk[t,n] is the received 
signal at UT k, time t and subcarrier n, hk[t,n\ is the 
corresponding M x 1 channel vector, x.[t, n] is the transmit 
signal vector and Zk[t, n] e?\f(0, iVo) is AWGN. The channel 
vector hk[t,n] has (frequency-domain) channel coefficients 
Hk,m[t, n] for BS antenna index to = 1, ... , M. We consider 
the SCM channel model used for urban micro-cell, urban 
macro-cell and suburban macro-cell fading environments [3]. 
This model considers P clusters of scatterers where each 
cluster corresponds to a path (same relative delay) and each 
path consists of Rp subpaths. Each subpath is characterized 
by an angle of arrival and by a complex amplitude coefficient. 
After standard manipulation, the frequency-domain channel 
(dropping indices fc, to by symmetry and for the sake of 
notation simplicity) can be written in the form of 

P Rp 

H[t, n]^Y.J2 A^,pe-^2^^^"eJ^'^^'-'''* (2) 

p=l r— 1 

where Ar^p is the complex amplitude coefficient of the {r,p)- 
th subpath, (r.p is the Doppler frequency shift of the {r,p)- 
th subpath (multiplies by the OFDM symbol duration) and 
Tp is the delay of the p-th path (multiplied by the subcarrier 
spacing). 

The BS makes use of linear zero-forcing beamforming 
(ZFBF), i.e., it produces the the transmitted signal vector as 
x[t,n] = V[t,n]u[t,n] where u[t,n] is a vector of user coded 
symbols and V[t, n] is the beamforming matrix. At each time- 
frequency slot [t,n], a subset of users §[t,n] of size S[t,n] < 
M is selected according the scheduling algorithm. Therefore, 
V[t, n] has dimensions M xS[t,n]. For a given set of selected 
users §[t, n] of size S[t, n], the corresponding ZFBF matrix is 
obtained by first computing the pseudoinverse of the matrix 
H[t, n] with columns {hfc[t,n] : k G §[t,n\}, where h[i, n] 
indicates the prediction of the actual channel hk[t, n] available 
at the BS, and then normalizing the pseudoinverse columns to 
unit norm such that the average transmit power per subcarrier 
is E[|x[t,7i]p] = P. Letting Vk[t,n] denote the beamforming 
vector for user k G §[t,n], and pk[t,n] its allocated power. 



the nominal achievable rate of user k is given by 

i?fe[i,n] =log(l + |h^[t,n]vfc[i,n]|V[i,"]) (3) 

We refer to this rate as "nominal" since this is what the 
BS assumes, based on the available CSIT in the current 
slot. In the case of non-perfect CSIT, user k suffers from 
multiuser interference since ZFBF is generally mismatched. 
Here we make the optimistic assumption that, no matter how 
mismatched the beamforming matrix is , an instantaneous rate 
equal to 

i?fc[t,n]=log 1 + ' , ^ ^' ^ ^ (4) 

is actually achieved. In practice, some mechanism for fast rate 
adaptation that cope with the BS mismatched CSIT can be 
used in order to approach the actual rate. Such optimistic 
achievability assumption provides an upper bound to the 
performance with any form of rate adaptation, e.g., as obtained 
by incremental redundancy ARQ and the use of rateless codes 
[15]. 

III. Scheduling 

If the CSIT of user k is accurate, i.e., if hfc w h^, then 
Rk , «] ~ Rk [t^ n] . On the other hand, if the channel of user k 
cannot be accurately predicted, then Rk[t, n] <^ Rk[t, n] since 
the multiuser interference in the denominator of the SINR in 
dUi is large. The PFS algorithm selects at each [t, n] a user 
subset §[t,n] in order to maximize the weighted rate sum 
Efc ^Tfcjt"^ where {Rk[t,n]} are the nominal rates at time- 
frequency slot [t,n] and {Tfci^]} are the actually achieved 
long-term throughputs at time t, which are recursively updated 
according to the rule 

1 ^ 

Tk[t] - (1 - 0)Tk[t - 1] + 5] Rk[t, n] (5) 

)i=i 

where /? £ (0, 1) is the factor that controls the average 
delay. This selection can be performed, for example, by using 
the greedy user selection algorithm of [8]. Consider a "non- 
predictable" user k. For what observed above, since Tk[t] is 
updated by using the actually achievable rate, it is likely that 
Tk[t] will become very small. Hence, user k will have a large 
weight in the PFS objective function and will be served very 
frequently. It follows that non-predictable users "eat-up" a 
lot of transmission resources from the system. Unfortunately, 
since they suffer from high multiuser interference power, their 
rate will remain low. Hence, due to scheduling, the presence of 
non-predictable users impacts negatively also the performance 
of predictable users. 

In order to avoid this problem, we propose the MPFS 
scheme where users are partitioned into a predictable set of 
size Kp and a non-predictable set of size Knp = K — Kp. 
PFS is applied with the additional constraint that if fc G Knp 
is selected on slot [t,n], no other user can be selected on 
the same time-frequency slot. Furthermore, since users in the 



non-predictable class have unreliable CSIT, the BS serves 
them using standard transmit diversity (i.e., it does not try 
to beamform). The nominal rate for these users is given by 
Rk[t,n] = log ^1 + |hfc[f, n] . Notice that in this case 
only a coarse form of CSIT is needed, which in practice 
amounts to knowing the user instantaneous SNR with some 
degree of accuracy. 

For the sake of comparison, we consider also a "hard- 
fairness" scheduling (HFS) scheme that aims at maximizing 
the minimum throughput over all users, i.e., HFS solves 
maxminfeTfe. Predictable users and non-predictable users are 
served in separated time-frequency slots. Given the symmetry 
of the channel with respect to all frequencies, we can focus 
on one subcarrier and let ap and a„p = 1 — ap denote 
the fraction of time slots allocated to predictable and non- 
predictable users. Non-predictable users are served in round- 
robin using transmit diversity. Modeling the user channel 



square magnitude 



|hfcp as a Chi-square RV with 2M 



degrees of freedom, the individual average throughput of any 
of these users, by symmetry, is given by 
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Yj (M - 1)! 



>M 



(6) 



iln(l + 
[4], where r(. 



, .) is the complemen- 



where a„ (/i) = 

tary incomplete gamma function defined by [9] r(a, x) = 
t^-^e-^dt. The predictable users can be served using 
regular PFS applied in their fraction of allocated slots. The 
average rate per user can be lower bounded by considering 
equal power allocation as 



M 

— Ye 



k=l 



log 1 



M' 



feVfel 



(7) 



For i.i.d. Rayleigh fading and using the fact that jhj^v/cp is 
Chi-squared with 2 degrees of freedom, the LHS of (O takes 
on the familiar form 



M M ( M 
— e^E, 1,— 



if, 



(8) 



where Ei {n,x) = ^-p^dt. 

The max-min solution is achieved by balancing the through- 
puts. By letting apTp = (1 - ap)Tnp we find ap = ^r^^^. 
In practice, since the statistics of the channels may be non- 
Rayleigh and the application of PFS for the predictable users 
may improve Tp with respect to the above lower bound, an 
adaptive slot allocation should take care of updating the value 
of Op in the HFS scheme. Our simulations based on the SCM 
channel model show that dimensioning Up according to the 
above rule of thumb yields values very close to the actual 
rate-balancing state. 



IV. Channel Prediction Schemes 

We consider the downlink common pilot arrangement of 
Fig. 12] Pilots are separated by Dt OFDM symbols in time 
and by D f subcarriers in frequency. A total of Nt x Nf pilots 
per antenna are collected over a window of duration NfDt, 
and used for prediction. Pilots for different transmit antennas 
are mutually orthogonal. In order to avoid aliasing, we need 
that Df < and Dt < j^'— where Tmax is the maximum 
delay and C^ax is the maximum Doppler frequency shift. 

Pilot location 



_ Channel predicted 



Fig. 2. Pilot pattern. 

By defining ujr.p ~ 2TT(r,pDt, Vp = 2'KTpDf, the output of 
the pilot observation (up to an irrelevant phase shift due to the 
location of the pilot subcarriers) can be written in the form of 
a 2-dimensional sum of sinusoids in additive white Gaussian 
noise as 

p Rp 

H[qDt,mDf] =Y.J2 Ar,pe^^^'---'+''-"''> + z[l, m] (9) 

p—l r—l 

Channel prediction under this model reduces to estimating 
the parameters of the sinusoidal components. We follow the 
method of [17], based on the repeated application of ESPRIT 
(details are not included due to lack of space). In [7] we 
have also considered non-linear Least-Squares, and various 
approximations of ML estimation [13], [19], [16], and we 
concluded that, for this problem, the ESPRIT method offers 
the best tradeoff between complexity and performance. 

An alternative channel prediction scheme consists of model- 
ing the discrete-time (time-domain) channel impulse response 
as Gaussian stationary vector process and using MMSE lin- 
ear prediction (Wiener filter) [18]. The time-domain channel 
impulse response {h{t, 0), . . . , h{t, L — 1)) is related to the 
frequency-domain channel by the DFT 



H[t, n] 



L-l 

E 



h{t,l)e 



-j2TTln/N 



(10) 



Since the channel is "sampled" by the downlink pilot grid 
every Dt OFDM symbols, it follows that prediction can be 
done in steps of Dt symbols. Then, the BS treats the channel 



as piecewise constant for blocks of length Df. For simplicity, 
we assume that the time-domain channel impulse response is 
measured by sending a single impulse in time followed by 
Nf — 1 zeros, with Nf > L, with impulse power NfP. In 
this way, the frequency domain pilot grid of Fig. |2] and the 
time-domain pilot scheme have the same overhead in terms of 
pilot symbols insertion ratio and total pilot power. Both the 
ESPRIT-based and the Wiener-based schemes need to compute 
and update the sample covariance matrix of the received signal. 
This reduces to a recursive computation that is very similar to 
standard RLS. 

It is interesting to notice that, in terms of system design, the 
two prediction methods yield to quite different CSIT feedback 
schemes. Wiener prediction produces a new predicted channel 
vector every Dt OFDM symbols, and this needs to be fed 
back within this delay. In contrast, ESPRIT produces a set 
of channel parameters that is used to extrapolate the channel 
over a large block (see [17] for a quantitative estimate of the 
duration over which the channel parameters can be considered 
to be constant ). The UT will estimate the channel parameters 
{Ar^p,uJr,p,Vp} from a block of pilots of duration DfNt 
OFDM symbols, extrapolate the channel for the next block, 
and feed back any convenient quantized representation of 
the extrapolated trajectory]! Fig [3] illustrates the different 
feedback modes suited to these prediction methods. As said, 
the feedback for ESPRIT is "batch" while for Wiener it is a 
continuous low-rate transmission. 



TABLE I 

System Specifications 




Continuous feedback for Wiener 



Batch feedback for ESPRIT 



Fig. 3. Batch versus frame by frame feedback. 



V. Simulations and Discussion 

We consider a system with parameters given in Table H] 
Maximum Doppler frequency shift is given by Cmax — ^ 
where c is the speed of light, v is the speed of user and fc 
is the carrier frequency. Duration of validity of parameters is 
given by Tvaiid — ^ 3/'"^2 where Tmin is the distance from 
user to nearest scatterer and we assume that rmin = 600m. 
The corresponding values are given in Table HI] 

We have compared the two prediction methods by con- 
sidering 4 different scenarios. Our findings are summarized 
by Table [III] When Doppler frequencies are well-separated, 

^In general, feeding baclc the channel coefficient is not the optimal strategy 
for a given target CSIT distortion. A rate-distortion approach to channel state 
feedback when the channel is modeled as a coirelated Gaussian source is 
proposed in [14]. In the case of the SCM channel model, we leave this 
interesting rate-distortion problem for future work. 



Description 


Value 


1/J , Sampling irequency 


3.84MHz 


fc. Carrier frequency 


2.6GHz 


A^, Number of subcamers 


256 


i\ (I, i'lUlllLJCl \Jl ClLllVC SUUi^ttll ici a 


200 


Length of CP 


64 


A/, Subcarrier frequency spacing 


15KHz 


Tsym, OFDM symbol with CP 


83.33/^s 


Nf, Number of Pilot subcaiTiers 


50 


Nt, Number of Pilot OFDM symbols 


100 


Df, Pilot subcarrier spacing 


4 


Dt, Pilot OFDM symbol spacing 


20 


Tmax, Maximum delay 


16.67/^s 



TABLE II 

Maximum Doppler frequency and duration of validity of 
model for ut with different speeds 



speed 


i; (Km/h) 


C^max (Hz) 


j!^valid (s) 


Tvaiid (OFDM symbols) 


High 


75 


180.56 


0.231 


2700 


Low 


5 


12.04 


3.458 


41500 



the Doppler frequency components of the channel can be 
accurately estimated by ESPRIT for both high speed and low 
speed user. In contrast, when Doppler frequencies are packed, 
if user is low speed then channel varies slowly and can be 
predicted by Wiener or even treated as piecewise constant 
while for high speed users both prediction methods fail. This 
class of channels can be considered as "non-predictable", i.e., 
the accuracy of prediction is not sufficient for the purpose of 
MIMO downlink beamforming. 

Next, we consider a BS with AI = 4 transmit antennas, K = 
8 single antenna UTs with v = 75km/h (high mobility), SCM 



channel model and SNR = 20dB. K„ 



2 users (users 1 



and 2) have packed Doppler frequencies and their prediction is 
very poor for both ESPRIT and Wiener prediction and Kp = 6 
users (users 3-8) have well-separated Doppler frequencies and 
their channels are predicted reliably by applying ESPRIT Figs, 
m |5] and |6] and Table |IV] demonstrate simulation results for 
different scheduling methods. Looking at the activity fractions 
of Table IIVI i.e., the fraction of slots in which a given user 
is scheduled, we notice that the naive PFS that makes use 
of unreliable CSIT tends to serve the non-predictable users 
(users 1 and 2) very often. This confirm the behavior that 
we described qualitatively in Section [III] This problem is 
eliminated by the proposed MPFS scheme. 

We observe that the proposed MPFS scheme achieve the 
best performance both in terms of throughput sum J^k'^k, and 
in terms of sum of log of throughputs J^k ^'^S ^fe (proportional 
fairness objective function [12], [11]). 

We conclude that by treating predictable and non- 
predictable users in two classes as in the proposed MPFS 
scheme, and by using the appropriate parametric prediction 
scheme, such as the one based on ESPRIT, multiuser MIMO 
downlink can be successfully applied also to users with 
relatively high mobility. 



TABLE III 
Best prediction method 





Well- separated 
Dopplers 


Packed 
Dopplers 


Low speed user 


ESPRIT 


Wiener 


High speed user 


ESPRIT 


All fail 



TABLE IV 
Activity fractions 





1 


2 


3 


4 


5 


6 


7 


8 


PES 


0.74 


0.79 


0.35 


0.37 


0.36 


0.35 


0.36 


0.42 


MPFS 


0.14 


0.15 


0.44 


0.44 


0.44 


0.44 


0.44 


0.53 


HPS 


0.26 


0.26 


0.29 


0.29 


0.29 


0.29 


0.29 


0.35 
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Fig. 4. Evolution of [t] over time for different users. 
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Fig. 5. Sum rate for PFS, MPFS and HFS for the configuration with 2 
non-predictable and 6 predictable high-speed users. 



\ l°9(T,) 



8 r 




1 I • • ' ' • • • ' 

2000 4000 6000 8000 10000 12000 14000 16000 



Fig. 6. Sum log rate for PFS, MPFS and HFS for the configuration with 2 
non-predictable and 6 predictable high-speed users. 



